Sparsification Algorithm for Cut Problems on Semi-streaming Model

نویسندگان

  • Kook Jin Ahn
  • Sudipto Guha
چکیده

The emergence of social networks and other interaction networks have brought to fore the questions of processing massive graphs. The (semi) streaming model, where we assume that the space is (near) linear in the number of vertices (but not necessarily the edges) is an useful and efficient model for processing large graphs. In many of these graphs the numbers of vertices are significantly less than the number of edges, and hence attract the semi-streaming model. We focus on the problem of graph sparsification in a single pass, that is, constructing a small space representation of the graph such that we can estimate the size of any cut. Graph sparsification is one of the major building blocks which is used in a variety of algorithms, and there has been a long history of (non-streaming) sampling that provide sparse approximations. Thus the space requirement for graph sparsification is a natural question. Since Ω(n) space is necessary for a one pass streaming algorithm to determine if a graph is connected, it gives an Ω(n) lower bound for any sparsification algorithms which approximates cuts multiplicatively. We show an essentially tight upper bound, that is, using Õ(n/ǫ) space we can create a sparsification in a single pass which approximates each cut to a (1± ǫ) factor. Department of Computer and Information Sciences, University of Pennsylvania, Philadelphia PA 19104-6389. Email: [email protected] Department of Computer and Information Sciences, University of Pennsylvania, Philadelphia PA 19104-6389. Email: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Graph Sparsification in the Semi-streaming Model

Analyzing massive data sets has been one of the key motivations for studying streaming algorithms. In recent years, there has been significant progress in analysing distributions in a streaming setting, but the progress on graph problems has been limited. A main reason for this has been the existence of linear space lower bounds for even simple problems such as determining the connectedness of ...

متن کامل

Analyzing Massive Graphs in the Semi-streaming Model

Massive graphs arise in a many scenarios, for example, traffic data analysis in large networks, large scale scientific experiments, and clustering of large data sets. The semi-streaming model was proposed for processing massive graphs. In the semi-streaming model, we have a random accessible memory which is near-linear in the number of vertices. The input graph (or equivalently, edges in the gr...

متن کامل

Graph Sparsification via Refinement Sampling

A graph G(V,E) is an ǫ-sparsification of G for some ǫ > 0, if every (weighted) cut in G is within (1 ± ǫ) of the corresponding cut in G. A celebrated result of Benczúr and Karger shows that for every undirected graph G, an ǫ-sparsification with O(n log n/ǫ) edges can be constructed in O(m log n) time. The notion of cut-preserving graph sparsification has played an important role in speeding up ...

متن کامل

A Framework for Analyzing Resparsification Algorithms

A spectral sparsifier of a graph G is a sparser graph H that approximately preserves the quadratic form of G, i.e. for all vectors x, xLGx ≈ x T LHx, where LG and LH denote the respective graph Laplacians. Spectral sparsifiers generalize cut sparsifiers, and have found many applications in designing graph algorithms. In recent years, there has been interest in computing spectral sparsifiers in ...

متن کامل

Graph Sparsification in the Semi-streaming Model[1]

Semi-stremining model. The title of the paper is “Graph Sparsification in the Semi-streaming Model”. First, we should note that it says semi-streaming and not streaming. In graph problems there is a linear space lower bound for even the simple problems such as determining the connectedness of a graph. In other words, we have to store at least all vertices of the graph. Because of that, this pap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009